Sparse Bayesian hierarchical modeling of high-dimensional clustering problems
نویسندگان
چکیده
منابع مشابه
Sparse Bayesian hierarchical modeling of high-dimensional clustering problems
Clustering is one of the most widely used procedures in the analysis of microarray data, for example with the goal of discovering cancer subtypes based on observed heterogeneity of genetic marks between different tissues. It is wellknown that in such high-dimensional settings, the existence of many noise variables can overwhelm the few signals embedded in the high-dimensional space. We propose ...
متن کاملBayesian Sparse Learning for High Dimensional Data
Statistical Science) Bayesian Sparse Learning for High Dimensional Data by Minghui Shi Department of Statistical Science Duke University
متن کاملBayesian Hierarchical Cross-Clustering
Most clustering algorithms assume that all dimensions of the data can be described by a single structure. Cross-clustering (or multiview clustering) allows multiple structures, each applying to a subset of the dimensions. We present a novel approach to crossclustering, based on approximating the solution to a Cross Dirichlet Process mixture (CDPM) model [Shafto et al., 2006, Mansinghka et al., ...
متن کاملInteractive Bayesian Hierarchical Clustering
Clustering is a powerful tool in data analysis, but it is often difficult to find a grouping that aligns with a user’s needs. To address this, several methods incorporate constraints obtained from users into clustering algorithms, but unfortunately do not apply to hierarchical clustering. We design an interactive Bayesian algorithm that incorporates user interaction into hierarchical clustering...
متن کاملMulti-rank Sparse Hierarchical Clustering
There has been a surge in the number of large and flat data sets – data sets containing a large number of features and a relatively small number of observations – due to the growing ability to collect and store information in medical research and other fields. Hierarchical clustering is a widely used clustering tool. In hierarchical clustering, large and flat data sets may allow for a better co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Multivariate Analysis
سال: 2010
ISSN: 0047-259X
DOI: 10.1016/j.jmva.2010.03.009